HEAD <<<<<<< HEAD
This is an R Markdown document. Markdown is a simple formatting syntax for authoring HTML, PDF, and MS Word documents. For more details on using R Markdown see http://rmarkdown.rstudio.com.
When you click the Knit button a document will be generated that includes both content as well as the output of any embedded R code chunks within the document. You can embed an R code chunk like this:
summary(cars)
## speed dist
## Min. : 4.0 Min. : 2.00
## 1st Qu.:12.0 1st Qu.: 26.00
## Median :15.0 Median : 36.00
## Mean :15.4 Mean : 42.98
## 3rd Qu.:19.0 3rd Qu.: 56.00
## Max. :25.0 Max. :120.00
You can also embed plots, for example:
Note that the echo = FALSE parameter was added to the code chunk to prevent printing of the R code that generated the plot.
For our project, our domain of interest is to observe the transition of standardized tests being a requirement for admission by universities. Specifically, we have narrowed our scope to analyze the relationship between average SAT scores and their relationship to income across zip codes in California. As an objective of our project, we would like to uncover some of the discepencies from area to area and how it affects students’ score. Thus, findings from our report will reveal whether standardized test are a true reflection of a student’s capabilities. The source of the data is from California Department Of Education and IRS website.
The summary includes some interesting analysis of how household income corelates to a student’s total SAT score. Based on our data, the mean and median household income for California is 62717 and 56646 respectively.The average SAT score for kids in below average income household was 1364.5 and for kids in above average income household was 1539.3.The zip code with the highest median household income was 95070.
The summary table portrays how total sat score and mean household income are different for different areas(Zip code) in California.
<<<<<<< HEAD ======= >>>>>>> a91661fcaa5558d6adbe920a451befd6906429fbFrom the table, we can clearly observe that zip codes in the beggining of the list are the areas with way less than average household income and low sat scores whereas zip codes towards the end of the list are the areas with way more average household incomes and higher sat scores.
The purpose of this visualization is to highlight the potential relationship between geographical locations and the average SAT performace. Using map visualization, we can get a better insight on location vs. SAT performance. Here is the visualization:
<<<<<<< HEAD ======= >>>>>>> a91661fcaa5558d6adbe920a451befd6906429fbFrom the map visualization we can see there seems to be a higher percentage of student passing the performance benchmark near San Francisco and Sacramento than other places. Student near Fresno seems to have lower performance than others.
The purpose of this visualization is to compare HS average SAT score and Median income by zip code. In particular, the key displays bar colors by zip code.
<<<<<<< HEAD ======= >>>>>>> a91661fcaa5558d6adbe920a451befd6906429fbFrom the bar chart we observe an upward trend as SAT score and median income increase. Towards the 1500 range in SAT score, not only do we see a concentration of data but the color key reflects that many of the zip codes are 955xx which is Humboldt county. Towards the upper boundary of highest SAT scores we see more 945xx zip codes which is Alameda county in the Bay area. This further confirms what we see from our map visualization.